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f RELATED APPLICATIONS 

£&$f|^ This application is related to the following copending application: 

■ U.S. Patent Application Serial No. 09/888,374, filed June 22, 2001 , 
entitled "Content Delivery Network Using Differential Caching." 
This application is hereby incorporated by reference as if fully set forth herein. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0002] This invention relates to reducing the bandwidth and computing resources 
required to transmit a web page over a network. 

Related Art 

[0003] It is desirable to transmit web pages in such a way as to minimize the bandwidth 
and other computing resources required to transmit the web page from a server 
device to a client device. 

[0004] One popular technique for minimizing bandwidth requirements and computing 
resources is to store the web pages at multiple mirroring servers throughout the 
network. By "pushing content to the edge of a network" (that is, caching the 
content and serving it from a mirroring server), it is possible to minimize the 
distance (that is, geographical distance or as measured by network topology) that 
information must travel before reaching a destination client device. Although 
redistribution of the load among multiple mirroring servers improves the quality of 
service by minimizing distance, it does not necessarily affect the bit rate at which 
a web page is transmitted. 

[0005] Another technique for minimizing bandwidth requirements involves compression 
of the web page. Compression generally involves the use of a computer 
program such as gzip, glib or some other similar program. Since HTML is very 
compressible (for example, the use of glib can result in 73.8 % compression of 
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HTML), there is a significant minimization in the amount of bandwidth required for 
transmission of web pages. One drawback to this technique however, is that 
compressing each web page is a relatively inefficient use of computing 
resources. Every time a web page is sent to a user, the page must be 
compressed, regardless whether the page was compressed in the past. For 
example, every time an organizational home page is sent from a server to a 
client device, computing resources are devoted to compression of that home 
page. In a second example, every time a server sends similar pages (such two 
different instances of an order form from an on-line retailer), those pages must be 
separately compressed without regard for their similarity. 

[0006] Accordingly, it would be desirable to provide a technique for serving relatively 
non-static content for delivery in a content delivery network. 

SUMMARY OF THE INVENTION 

[0007] In a first aspect of the invention, a template for a web page or for a set of web 
pages is identified and compressed at an originating server. A template can 
include either (1) an entire web page or (2) those elements in a web page that 
are relatively unchanging. Examples of templates include: 

■ Entire web pages 

■ Order form pages without the personalized information 

■ Stock update pages without the ticker information 

■ Product specs without the product descriptors 

■ Ticket sale pages, without information about a particular event 

■ Portions of comparable web pages wherein a portion of the web page 
does not change frequently. 

[0008] A library of these compressed templates is stored at the originating server and 
the mirroring servers so as to make it available for subsequent requests. 
Generally, compression of the template only occurs once. The template does not 
need to be compressed again, unless changes to the template itself are made 
(for example, changes to the style of the template). In this respect, the invention 
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is very different from the prior art of HTML compression which requires 
compression of a web page every time the page is sent. By minimizing the 
number of times that compression takes place, fewer computational resources 
need be allocated toward compression operations. In a preferred embodiment, 
the compressed template is cached and served from a remote mirroring server. 
In other embodiments, the compressed template is cached and served from the 
originating server. When a client device requests a web page, the compressed 
template is sent from either the mirroring server (if the template is present) or 
from the originating server (if the template is not present at the mirroring server) 
by way of the mirroring server to the client device. 

[0009] In a second aspect of the invention, delta information from a set of web pages is 
identified and compressed at the originating server. Unlike templates which 
include substantial similarities, delta information refers to a set of differences 
such as differences between instances of a particular web page. For example, in 
a web page of a completed order form, the web page may be divided into (1 ) a 
template, including the unchanging features of the page (or the entire page if 
there is no dynamic information in the page); and (2) delta information comprising 
specific personalized information pertaining to an individual order. In a preferred 
embodiment, the delta information is compressed at the originating server and 
sent from the originating server to the client. In a preferred embodiment, delta 
information is compressed each time is it transmitted. Given that the delta 
information is usually relatively small, the amount of computational resources 
devoted to its compression is also relatively small. This selective compression 
minimizes the bandwidth needed to transmit the page while simultaneously 
minimizing the computing resources allocated for compression. In other 
embodiments, the delta information may be compressed once and cached until 
such time that the compressed delta information is reused. This may be useful 
when the delta information includes advertising copy that may be repeated again. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] Figure 1 shows a block diagram of a system for efficient compression using 
differential caching. 

[001 1] Figure 2 shows a flow diagram of a method for efficient compression using 
differential caching. 

[0012] Figure 3 shows a data flow diagram in a system for efficient compression using 
differential caching. 

[0013] Figure 4 shows a data flow diagram for efficient compression using differential 
caching using a proxy encoder server. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0014] In the following description, a preferred embodiment of the invention is described 
with regard to preferred process steps and data structures. Those skilled in the 
art would recognize after perusal of this application that embodiments of the 
invention can be implemented using one or more general purpose processors or 
special purpose processors or other circuits adapted to particular process steps 
and data structures described herein, and that implementation of the process 
steps and data structures described herein would not require undue 
experimentation or further invention. 

Lexicography 

[001 5] Originating Server - as used herein, an "originating server" takes on the role of a 
server in a client-server relationship and is the original provider of content to a 
client device or to a mirroring server. 

[0016] Mirroring server - as used herein, a "mirroring server" includes any device that 
takes on the role of a server in a client-server relationship and that receives 
requests from client devices and responds to those requests by sending content 
that originated (in whole or in part) at an originating server. 
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[0017] Template - as used herein, the term "template" refers to a selected portion of a 
web page that is relatively unchanging. If there is no difference between different 
instances of a web page, then the entire page may be a template. 

[0018] Delta information - as used herein, the term "delta information" refers to a 
selected portion of a web page varies between instances of the web page. 

System Elements 

[0019] Figure 1 shows a block diagram of a system for efficient compression using 
differential caching. 

[0020] A system for efficient compression using differential caching (shown by general 
character reference 100) includes one or more client devices 1 10 under the 
control of user 1 12, an originating server 120, a set of mirroring servers 130 and 
a communication network 140. A proxy encoder server 150 may be positioned 
"in front" of the originating server 120. 

[0021] Each client device 110 includes a processor, an input element, a presentation 

element, a local memory and system software. Client devices 110 further include 
software 1 14 disposed for communicating with the communication network 140 
and software 1 16 for integrating a web page. 

[0022] In a preferred embodiment, each client device 1 10 includes a general-purpose 
computer, such as a laptop or workstation. However, a client device 110 can 
also include (either alone or in conjunction with a laptop or workstation), a hand- 
held calendar (such as a "Palm Pilot" or other hand-held device), a portable 
computer, a special purpose computer, a cellular telephone or other telephonic 
device, a web server acting as the agent for a user, or another device. In 
alternative embodiments, a client device 110 may also include any other device 
disposed for performing all or some of the functions described herein. 

[0023] The software 1 16 for integrating a web page includes elements for 

decompressing a template, decompressing delta information, and integrating the 
template and delta information into a unified presentation for the user 112. 
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[0024] In some embodiments, the client device 110 also includes a decoder 118, 
preferably as a browser "plug-in". However, the decoder 118 may also be 
situated in other locations downstream from the originating server 120, such as 
at a cache or firewall associated with an ISP or at the edge of an enterprise 
network. The decoder 118 specifies that it accepts delta encoding by adding 
information to the HTTP header In such embodiments, the decoder 118 may 
also perform the functions performed by software 114, such as integrating 
information. 

[0025] Those embodiments of the invention that do not include a decoder 1 18 are 

known as "clientless". In such instances, the server sees that the request does 
not come from a decoder 118 and serves deltas in the form of javascript 
instructions and a reference to the template. The scripting capabilities of the 
client's browser are directed toward applying the delta to the template and 
displaying the HTML page. In other embodiments, other scripting techniques 
may also be used. 

[0026] The originating server 120 is under the control of an entity that provides web 

pages for users 112. Similar to the client devices 110, the originating server 120 
includes a processor, an input element, a presentation element, system software 
and a local memory. However, unlike the client device 110, the originating server 
120 includes a cache of the complete set of compressed web page templates 
122, a cache of delta information 124, compression software 126 (for example 
gzip, glib or another comparable program) and web server software 128. The 
compression software 126 is used to compress both template information and 
delta information. 

[0027] The compressed templates included in the cache of the complete set of 

compressed web page templates 122 are derived from web pages that include 
relatively unchanging elements, such as the backdrop in weather pages, blank 
charts used in stock pages, incomplete order form pages and similar web pages 
with relatively unchanging elements. If a web page does not include dynamic 
elements, the compressed template may include the entire web page. In a 
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preferred embodiment, the template is compressed and cached with the 
complete set of compressed web page templates 122. It does not have to be 
compressed again unless the information itself is updated (for example, with 
stylistic changes). 

[0028] The delta information in the cache of delta information 124 includes those 

portions of a set of web pages that are highly variable. This delta information can 
be derived from web pages that include elements that change frequently such as 
a weather forecast, the value of a stock, information to complete an order form 
and comparable information that is either very ephemeral or unique to a 
particular user 112. If the total number of sets of delta information 124 
associated with a particular URL is relatively low (for example, a few different 
rotating banner advertisements) and that information is relatively small, the 
server can maintain a table mapping uncompressed deltas to compressed deltas. 
This saves computing resources because there is no need to re-compress the 
delta information every time it is served. Optimal benefits are obtained if the 
delta information is requested many times before the template changes. 

[0029] The set of mirroring servers 130 is usually under the control of the same entity 
that controls the originating server 120. Similar to the originating server 120, the 
set of mirroring servers 130 includes a processor, system software, an input 
element, a presentation element, a local memory, a cache of compressed web 
page templates 132, compression software 134, and web server software 136. 
However, unlike the originating server 120, the cache of compressed web page 
templates 132 at the mirroring server 130 is not necessarily complete. Moreover, 
compression software 134 at the mirroring server 130 is generally used to 
compress delta information rather than templates as does the compression 
software 126. In a preferred embodiment, the mirroring servers are relatively 
more local to the client devices 1 10 than the originating server 120. 

[0030] As noted supra, some embodiments of the system 100 also include a proxy 
encoder server 150. In these embodiments, the proxy encoder server 150 is 
coupled to the originating server 120, either as a separate server or as an 



-7- 



encoder coupled to the originating server. The proxy encoder server 1 50 
compares information stored locally or at the client to possibly fresher 
information from the mirroring server 130 or the originating server 120 and 
serves the compressed template and delta information to the client device 110. 

[0031] The communication network 140 is disposed for transporting compressed 
templates, delta information and requests for web pages between the client 
devices 110, the originating server 120 and the mirroring server 130. In a 
preferred embodiment, the communication network 140 includes a packet 
switched network such as the Internet, as well as (in conjunction with or instead 
of) an intranet, an enterprise network, an extranet, a virtual private network, a 
virtual switched network, or a wireless network. In alternative embodiments, the 
communication network 140 may include any other set of communication links 
that couple the client devices 1 10 the originating server 120 and the mirroring 
servers 130. 

Method of Operation 

[0032] Figure 2 shows a flow diagram of a method for efficient compression using 
differential caching. 

[0033] A method 200 for efficient compression using differential caching is performed by 
the system 100, including a set of client devices 1 10, an originating server 120, a 
set of mirroring servers 130 and a communication network 140. 

[0034] Although described serially and in a particular sequence, in a preferred 

embodiment the steps described herein can be performed concurrently or in 
parallel by the system elements, or could be performed in a different sequence or 
some combination thereof. 

[0035] At a flow point 205, a user 1 12 is ready to request a web page from a mirroring 
server 130. In a preferred embodiment, the mirroring server 130 is relatively 
more proximate to the user 112 than the originating server 120. 
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[0036] At a step 210, the user 1 12 causes the client device 1 10 to generate a request 
message to the mirroring server 130 for an identified web page. In a preferred 
embodiment, each request for a web page is performed independently, even if a 
plurality of requests are to be performed substantially simultaneously. 

[0037] In other embodiments, the client device 110 may request the web page from the 
originating server 120. Bypassing the mirroring server 130 may save bandwidth 
if the originating server 120 is closer to the user 112 than the mirroring server 
130. 

[0038] When the request is made, the client device 110 indicates whether it can receive 
delta information. If the client device 110 can receive delta information, the 
method proceeds with step 215. 

[0039] At a step 21 5, the mirroring server 1 30 determines if a compressed template for 
the web page is available in the cache of compressed web page templates 132. 
If necessary, the mirroring server 130 obtains the compressed template from the 
originating server 120. If the compressed template is unavailable from the 
originating server 120 (this may be the case if the template has not been 
requested before, or if the template has been recently changed), a template is 
created and compression software 126 at the originating server 120 compresses 
the template. The newly compressed template is sent from the originating server 
120 to the mirroring server 130 where it is stored in the cache of compressed 
web page templates 132. 

[0040] At a step 220, the mirroring server 1 30 transmits the compressed template to the 
client device 110. 

[0041] If the mirroring server 130 were bypassed in step 210, the compressed template 
is sent to the client device 110 from the originating server 120. 

[0042] At a step 225, the client device 1 10 or the mirroring server 130 (depending upon 
the configuration of the system 100) transmits a message to the originating 
server 120 for the delta information. 
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[0043] At a step 230, the originating server 120 identifies the delta information in the 
cache of delta information 124 and compresses it using the compression 
computer program 126. Depending on the size and type of delta information 124, 
this compressed delta information may be stored at the originating server 120 or 
the mirroring server 130. This is particularly useful if there is a strong likelihood 
that the delta information will be requested again. 

[0044] At a step 235 the originating server 120 transmits the compressed delta 
information to the client device 110. 

[0045] At step 240 the client device 110 integrates the template and delta by performing 
the following substeps: 

■ At a substep 240(a), the template is decompressed. 

■ At a substep 240(b), the delta information is decompressed. 

■ At a substep 240(c), the decompressed template and decompressed 
delta are integrated so as to form a complete web page that is presented to the 
user 112. 

[0046] Given that the software 1 16 for integrating a web page is present at the client 
device 110, the resources required to decompress the delta information and 
template and integrate them do not contribute to the total resources required by 
the servers to compress and serve the web page. 

[0047] In other embodiments, the system 100 does not include a decoder 118. These 
embodiments are known as "clientless", because many of the functions normally 
performed by the decoder 1 18 are performed differently. In such embodiments, 
the originating server 120 identifies that the client can receive a delta (e.g. via a 
cookie), and serves a delta instead of the document. The delta contains a 
reference to the template, which can be served from either the mirroring servers 
130 or from the originating server 120. In such embodiments, the template is 
compressed once and cached at the mirroring server 130. 

[0048] Figure 3 shows a data flow diagram in a system for efficient compression using 
differential caching. 
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[0049] The system 300 includes a set of data flows for sending and receiving 

information between the client devices 110, the originating server 120 and the 
mirroring server 130, using the communication network 140. It should be noted 
that it is not necessary to exhaust every data flow to achieve efficient differential 
caching. 

[0050] A data flow 310 includes messages sets between the client device 110 under the 
control of the user 112 and the mirroring server 130. 

[0051] Message set A from the client device 1 10 to the mirroring server 130 includes 
requests for web pages. 

[0052] Message set B from the mirroring server 130 to the client device 110 includes the 
following: 

■ compressed templates 

■ compressed delta information. 

[0053] A data flow 320 includes messages sets between the mirroring server 130 to the 
originating server 120. 

[0054] Message set C from the mirroring server 130 to the originating server 120 
includes the following: 

■ requests for compressed templates that are not available in the cache of 
compressed web page templates 132 

■ requests for delta information if the delta information is not available at 
the mirroring server 130. 

[0055] Message set D from the originating server 120 to the mirroring server 130 
includes the following: 

■ compressed templates 

■ compressed delta information. 

[0056] A data flow 330 includes messages sets between the originating server 120 and 
the client device 110. 
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[0057] Message set E from the client device 1 10 to the originating server 120 includes 
the following: 

■ requests for compressed templates 

■ requests for compressed delta information. 

[0058] Message set F from the originating server 120 to the client device 110 includes 
the following: 

■ compressed templates 

■ compressed delta information. 

[0059] Figure 4 shows a data flow diagram for efficient compression using differential 
caching using a proxy encoder server. 

[0060] A method 400 for efficient compression using differential caching is performed by 
the system 100, including a set of client devices 1 10, an originating server 120, a 
set of mirroring servers 130, a proxy encoder server 150 and a communication 
network 140. 

[0061] Similar to method 200, the steps described herein can be performed concurrently 
or in parallel by the system elements, or could be performed in a different 
sequence or some combination thereof. 

[0062] At a flow point 405, a user 1 12 is ready to request a web page from a mirroring 
server 130. In a preferred embodiment, the mirroring server 130 is relatively 
more proximate to the user 112 than the originating server 120. 

[0063] In a step 410, the user 112 requests the web page. In a preferred embodiment, 
this request is made using the decoder 118. The decoder 118 intercepts the 
request and redirects it to the proxy encoder server 150. The decoder 118 also 
informs the proxy encoder server 150 that the client device 110 can receive delta 
and template information, and provide delta encoding. 

[0064] In other embodiments, the request can be made without using the decoder 118. 
In these other embodiments, the request goes directly to the proxy encoder 
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server 150. The absence of a decoder 118 indicates that the client cannot 
provide delta encoding. 

[0065] In a step 41 5, the proxy encoder server 1 50 obtains the web page or a template 
corresponding to the web page from the mirroring server or originating server. 
The proxy encoder server 150 compares the web page or template with 
information in the cache of compressed web page templates 132 (as noted 
supra, in such embodiments, these caches are local to the proxy encoder server 
150). If there is not a corresponding template in the cache of compressed web 
page templates 132 (as may be the case if the web page was never requested 
before), then a new template is generated, compressed and cached. If there is a 
corresponding template in the cache of compressed web page templates 132 
and that corresponding template needs to be updated, then updating is 
performed at this time and the updated template is compressed and cached. 
The compressed template may be tagged with information specifying a version 
number associated with the template. 

[0066] In those embodiments that do not provide delta encoding, the compressed 

template may be stored at the client device 110. In such embodiments, the proxy 
encoder server 150 calculates the difference between the compressed template 
at the client device 110 and the fresh page and calculates a delta. The delta is 
an HTML page that includes a reference to a compressed template (preferably a 
Javascript) and some Javascript instructions. In this embodiment, the Javascript 
instructions tell the client device 110 how to transform the template into the 
updated web page. 

[0067] In a step 420, the proxy encoder server 150 sends the compressed template to 
the client. This compressed template may be stored at the client device 110. 

[0068] The following steps occur when the user 1 1 2 requests the same web page 
associated with the compressed and cached template: 

[0069] In a step 425, the user 112 requests a web page using the decoder 118. In 
making the request, the decoder 118 also specifies what version of a 
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compressed template for the web page has been received in the past. This 
request is directed to the proxy encoder server 150. 

[0070] In a step 430, the proxy encoder server 150 receives the request and identifies a 
compressed template that is responsive to the request. The proxy encoder 
server 150 calculates the difference between the version of the compressed 
template and a presumably fresher version of the web page that is obtained from 
the originating server 120 or the mirroring server 130. This difference is the delta 
information. 

In step 435, the proxy encoder server 150 sends delta information such as is 
responsive to the request. In some embodiments, the delta information is 
compressed before sending. In other embodiments, the delta information is 
compressed and cached. 

In the "clientless" version, the proxy encoder server 150 does not know what 
version of the template is at the client. The proxy encoder server 1 50 makes this 
decision and instructs the client to use a specific version of the template. This is 
comparable to steps 310 - 335. 

Alternative Embodiments 

[0073] Although preferred embodiments are disclosed herein, many variations are 

possible which remain within the concept, scope and spirit of the invention; these 
variations would be clear to those skilled in the art after perusal of this 
application. One such variation includes caching (either at the originating server, 
mirroring server or proxy encoder server) a Huffman tree corresponding to a web 
page that has been requested in the past. In such alternative embodiments, the 
delta is calculated by comparing the Huffman tree to newer versions of the tree 
and computing a delta based upon those parts of the tree that have changed. 
This technique is preferable for web pages that are requested very frequently (or 
that change very rarely). In other embodiments, a Huffman tree corresponding 
to a template is generated and served separate from the delta information. 
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